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Abstract We present an empirical model based on the visible area covered 
by coronal holes close to the central meridian in order to predict the solar 
wind speed at 1 AU with a lead time up to four days in advance with a 1- 
h time resolution. Linear prediction functions are used to relate coronal hole 
areas to solar wind speed. The function parameters are automatically adapted 
by using the information from the previous 3 Carrington Rotations. Thus the 
algorithm automatically reacts on the changes of the solar wind speed during 
different phases of the solar cycle. The adaptive algorithm has been applied to 
and tested on SDO/AIA-193 A observations and ACE measurements during 
the years 2011 — 2013, covering 41 Carrington Rotations. The solar wind speed 
arrival time is delayed and needs on average 4.02 ±0.5 days to reach Earth. The 
algorithm produces good predictions for the 156 solar wind high speed streams 
peak amplitudes with correlation coefficients of cc ^ 0.60. Eor 80% of the peaks, 
the predicted arrival matches within a time window of 0.5 days of the ACE 
in situ measurements. The same algorithm, using linear predictions, was also 
applied to predict the magnetic field strength from coronal hole areas but did 
not give reliable predictions (cc ~ 0.2). 
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1. Introduction 

The Sun’s corona is constantly emanating magnetized plasma into space, the 
solar wind, consisting of charged particles (protons, electrons. Helium ions). The 
slow solar wind has typical speeds in the range 300-400 km s“^, whereas the 
so-called high-speed solar wind streams (HSSs) may reach speeds up to 800 km 
s“^. HSSs originate from coronal holes (CHs) on the Sun 


(Krieger, Timothy, 
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and Roelof 1973). Since the 1970s, CHs are known for being long-lived and 


rigidly rotating phenomena in the solar corona (Krieger, Timothy, and Roelof 


1973 Neupert and Pizzo 1974 Nolte et al. 1976), magnetically “open” to 


interplanetary space, allowing the solar wind particles to escape from the Sun 


(Tsurutani et al. 1995 Gosling and Pizzo 1999 Cranmer 2009). The plasma 


temperature and density in CHs is low compared to their surroundings, and thus 


they appear as dark areas in the corona (Munro and Withbroe 1972 Cranmer 


2009). They can be observed in space-based X-ray (Wilcox 1968[ Altschuler, 


Trotter, and OrraIH[1972 Hundhausen 1972) and EUV images (Newkirk, 1967 


Tousey, Sandlin, and Purcell 1968 Del Zanna and Bromage 

1999). More recent 

studies on the outflow dynamics and Doppler shifts of coro: 
the understanding of CHs being the source of the fast solar v 

nal holes improved 

dnd (Hassler et al. 

19991 |Peter and Judge[ |1999| |Xia, Marsch, and Curdtl 20031 Aiouaz, Peter, and 


Lemaire 2005) 


Studying coronal holes and their associated HSSs is an important task, since 
in combination with the Sun’s rotation they shape the solar wind distribution 
in interplanetary space and are the dominant contributors to space weather 
disturbances at times of quiet solar activity, due to recurrent geomagnetic storm 
activity. The distribution of the solar wind is also decisive for deriving the 


propagation behavior of CMEs (Temmer et al. 2011). The relationship between 


the size and location of CHs in the corona and the solar wind parameters and 
geomagnetic effects measured at 1 AU has been investigated by several groups 


(Nolte et al. 1976 

|Robbins, Henney, and Harvey| 20061 Vrsnak, Temmer, and 

Veronigl 2007a|b| 

Luo et al.\ |2008| 

Abramenko, Yurchyshyn, and Watanabe 

2009 

Obridko et al. 2009 de Toma| 

2011 Verbanac et al.\ 20111 Lowder et al. 


2014). In the early phases of coronal holes identification, CHs were visually 


tracked by experienced observers (Harvey and Recely 2002 McIntosh 2003). 


Recently, several groups tried to automate the process for the identification 
and detection of coronal holes using different approaches such as perimeter 


tracing (Kirk et al. 2009), fuzzy clustering (Barra et al. 2009), multichannel 


segmentation (Delouille, Barra, and Hochedez 2007), edge-based segmentation 


(Scholl and Habbal 2008), intensity thresholding ( Krista and Gallagherf 2009 de 


Toma 


2014). 


2011 Rotter et al. 2012) and magnetic track-boundaries (Lowder et al. 


One way to model the physical parameters of the solar wind are MHD mod¬ 
els of the corona and heliosphere, using synoptic solar magnetic field maps as 


input, such as ENLIL (Odstrcil and Pizzo 2009). Since large and long-lived 


CH structure cause clear signatures in the solar wind parameters measured at 
1 AU they can be used to derive empirical relationships between the CH area 
on the Sun and the solar wind parameters. To this aim, we extract the areas 
of coronal holes from remote sensing EUV imagery and relate these results to 
in situ measurements of the solar wind parameters. We developed an adaptive 
algorithm, using linear relations that react on the solar cycle variations. In the 
present paper we evaluate and adjust the algorithm based on observations during 
the beginning of solar cycle 24, the years 20II-20I3. In Section 2 we describe 
the data used for our study. Section 3 describes the adaptive linear prediction 
method. The method is applied to predictions of solar wind speed and magnetic 
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field strength at 1 AU. As will be shown, reliable predictions are obtained for v 
and not for B. The results of the predictions are shown in Section 4. Conclusions 
are drawn and discussed in Section 5. 


2. Data 


Our analysis is based on the following data sets: The fractional coronal hole 
areas, A (the area of coronal holes calculated as a fraction over a slice of 
±7.5°), derived from EUV images and the solar wind speed v and magnetic 
field strength B measured in situ at LI. As described in Rotter et al. (2012), 


we adjusted the coronal hole extraction algorithm to process the Ik by I k Solar 


Dynamic Observatory SDO Atmospheric Imaging Assembly AIA-193A (Lemen 
et al. ( 2Q12| )) quicklook images, accessible in near real-time at Royal Observatory 


of Belgium ROB (http://sdoatsidc.oma.be/), or at the Joint Science Operations 
Center JSOC (http://jsoc.stanford.edu/). 

For our purposes we work with images with an approximate cadence of 1 h 
using the SDO software routines provided within SolarSof t. We utilize the solar 
wind data measured in situ at 1 AU by Solar Wind Electron Proton and Alpha 
Monitor (SWEPAM ; [McComas et al. 1998) and the magnetometer instrument 
(MAG; Smith et a/.[|1998| ) on board the Advanced Composition Explorer (ACE; 
[Stone et al.\ |1998|). The hourly-averaged level-2 ACE data, available at http: 
//www.srl.caltech.edu/ACE/ASC/level2/, with a resolution of 1 h is used. We 
analyse the data available 2010 October 3 to 2013 December 31. In total, we 
processed a set of 28512 AIA-193 A images. 


3. Methods 


The main goal is to fully automate the process of identification and extraction of 
coronal hole regions in solar EUV images and to derive near real-time predictions 
for solar wind parameters at 1 AU up to 4 days in advance. The algorithm 
performs the following main steps: 


1. Ftp-download and image selection. 

2. Process selected images and derive fractional CH areas. 

3. Derive prediction parameters via adaptive algorithm. 


Step 1: Automatically download and select images for our purpose. In case of 
the SDO/AIA 193A quicklook images, we download images in quasi real-time 
with a cadence of 1 hour. The quicklook images are available with a delay of 
^3 hours after the observation. The delay is caused by data downlink and data 
preparation. Images are checked for their quality. Data including bad pixels, flare 
emission, or eclipse data are excluded from further analysis. 

Step 2: The CH extraction algorithm utilizes a histogram-based thresholding 
segmentation technique to detect the boundaries of CHs (Rotter et al. ( 2Q12| )). 
Figure [^illustrates the major steps in image processing. On the left panel a fully 
calibrated SDO/AIA-193A taken on 28 February 2011 is shown. The algorithm 
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searches for a dominantlocal minimum in the intensity distribution for the EUV 
image. On average it lies between 55 ± 15 digital numbers (DN), consistent with 
the findings of Lowder et al (2014). If the algorithm is not able to detect a 
well defined minimum in the distribution, which is the case for images where 
< 5% of the whole solar disc is covered by coronal hole pixels, the threshold 
is set to 55 DNs. In approximatly 18% of all processed images, the algorithm 
was set to the fix threshold. The resulting binary map undergoes mathematical 
morphological image processing techniques to remove small anomalies (Erosion) 
and to gradually fill small holes in-between detected CH regions (Dilation), 
which is illustrated in the middle panel. The right panel shows the original 
SDO/AIA image overlayed with the processed CH binary map. The fractional 
CH area. A, is derived inside the meridional slice ±7.5° for the EUV image 
corresponding to the solar rotation over ^1 day. All identified CH pixels inside 
the given slice are summed up and divided by the total number of pixels in the 
slice (see also Rotter et al. ]( |M^ ). Figure |2] illustrates examples of the image 
processing algorithm on SDO/AIA-193A for the beginning of the year 2011. 



Figure 1. Left: Calibrated SDO/AIA-193 A image on 28-02-2011. Middle: Resulting binary 
map using histogram thresholding. Right: CH binary map overlayed on the original AIA image. 
The yellow lines mark the derived fractional area A in the considered meridional slice [±7.5°]. 


Step 3: The forecast is based on deriving the delay between the fractional 
CH area time series and the solar wind speed V(^t^ in situ measured by 
ACE using cross-correlation. Linear least-square fits tend to underestimate and 
smooth out peak values, and are thus not well suited as a functional form for the 
forecasting procedure of peaks in the solar wind speed. To predict the solar wind 
speed, we shift the CH area time series with the derived lag r and apply a linear 
function = cq ± ciAq) (cf. Vrsnak, Temmer, and Veronig, 2007a). The 

parameters cq and ci are obtained by searching for the minimum and maximum 
values of the solar wind speeds and the corresponding CH fractional areas in 
preceding periods. We exclude values of high solar wind speeds (> 800 km s“^) 
that are connected to CH areas A < 0.1, as they most probably arise due to 
interplanetary CMEs and not due to HSSs. 

The algorithm reacts on variations in the relation between the CHs and solar 
wind speed. To account for the variations over the Solar Cycle, the algorithm 
is automatically adapted by using the information from the three preceding 
Carrington Rotations (CRs) to update the prediction parameters cq and ci and 
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Figure 2. Examples of CH detections for SDO/AIA-193 A. 


the lag r used for the predictions of the solar wind speed. These parameters are 
automatically updated by using a moving window of the duration of three CRs, 
sliding in steps of one CR. For the prediction function of a subsequent CR, we 
basically use the CH area, and solar wind speed space covered by the 
three preceding CRs. Testing showed that the adaptive algorithm performs best 
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when using the three CRs moving window. It covers a long enough time frame 
to catch solar wind speed variations caused by CHs. 

The procedure is illustrated in Figure for the solar wind speed. The dashed 
vertical line indicates the beginning of the predictions for the CR 2114 based on 
the preceding CRs 2111-2113. The cross-correlation of the fractional CH areas 
(top panel) and in situ measurements by ACE (middle panel) delivers the 
time delay (bottom left). We shift the Ap) time-series with the corresponding 
time lag r by 4.29 days. Similar to figure we applied the adaptive algorithm 
for the same time (CRs 2111-2113) on the magnetic field strength, which can 
be seen in Figure For B we excluded values > 15nT that are connected 
to CH areas A < 0.1. The red rectangle covers the boundaries used for the 
prediction functions (bottom right). We exclude the ^ 1% of extreme values. 
The green bars indicate identified ICMEs at 1 AU (|Cane and Richardson (2003); 
http://www.srl.caltech.edu/ACE/ASC/DATA/level3/icmetable2.htm). 

Figure shows the evolution of the parameters derived for the prediction 
functions for each Carrington Rotation. Applying the adaptive algorithm to 41 
CRs we found an average time lag of 4.07 ± 0.07 days for v and 1.98 ± 0.03 for 
B. We used the knowledge of the average delays and set time lag boundaries for 
between 3.5 — 5 days and for B between 1.5 — 2.5 days. If the delay derived 
from the previous 3 CRs does not lie between the given boundaries the delay 
is set to the average value of 4 days in the case of the solar wind speed and 2 
days for the magnetic field strength. This only occurred twice for the solar wind 
speed and 7 times for the magnetic field parameter. 


4. Results 

In Figure]^ we show the distribution of the fractional CH areas (left columns), 
the in situ measurements by ACE for the solar wind speed (middle columns) and 
magnetic field strength (right columns) for the years 2011-2013. The yearly mean 
total sunspot numbers, Ri taken from the SILSO (http://www.sidc.be/silso/), 
rising from Ri = 56 up to Ri = 65 over the years 2011-2013, indicating the slow 
rise of solar activity of cycle no. 24. The average size of the fractional coronal 
hole area A and the magnetic field strength vary slightly over the years. The 
average solar wind speed decreases from v = 424 km s“^ to = 395 km s“^. We 
also find a change in the occurrence rate of high solar wind speeds (600 — 800 km 
s“^) which is 4.4% in the year 2011 down to 2.9% in the year 2013. We measured 
an increase of low solar wind speeds (< 350 km s“^) over the years, increasing 
from 21% in 2011 to 40% in 2013. 

Figures and show the solar wind speed v and magnetic field strength B 
predictions (red lines) based on the adaptive algorithm together with in situ 
measurements by ACE (black lines) at 1 AU for the years 2010-2013. The algo¬ 
rithm begins with the 3rd of October 2010, calculating the prediction function for 
CR2105 based on CR2102-2104. Cross-signs mark peaks in predicted solar wind 
speed and stars depict peaks in the corresponding measurements. Green bars 
depict the start and the end of identified ICMEs at 1 AU taken from Richardson 
and Cane list. The algorithm adapts to the variations of the slow solar wind 
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Figure 3. Top: CH fractional areas. Middle: Solar wind speed measured (black line) and 
predictions (red line) at 1 AU for the time range CR2111-2114. The dashed vertical line 
shows the boundary when the predictions for the following CR starts. The green bars indicate 
identified ICMEs at 1 AU (from Richardson and Cane list). Bottom left: the cross-correlation 
coefficient and the derived time lag in days. Bottom right: Scatter plot of the time lagged 
fractional CH areas and the measured solar wind speed. The prediction function (red line) and 
its linear function coefficients given as inset. 


during a year. The values of the predicted slow solar wind speed follow the 
trend in the measurements (see Fig. 0 and Fig. [^. 

Figure 1^ shows scatter plots of 156 peaks identified in predicted and measured 
solar wind speed u, and magnetic field strength B (indicated in Figures [7][^ for 
the years 2011-2013. The peaks were automatically determined by scanning for 
local maxima in the extracted fractional areas A data that exceed a value of 
0.15. It automatically searches for the closest peak in the measured solar wind 
data in a given ±3 days window. 

We derived correlation coefficients cc for every year separately for the peaks in 
V as well as B. For the solar wind speed we derived cc^ 2011 = 0.77, cc^ 2012 = 0.72 
and cc ^2013 = 0.40. Correlating all measured versus predicted peaks we found 
ccy total = 0.61. For the magnetic field strength we found lower correlations with 
CCB2011 = 0.20, CCB2012 = 0.10 and ccb 2013 = 0.33 and a ccBtotai = 0.15. 

The histograms in Fig. depict the differences in the arrival time between 
measured and predicted peaks. The average prediction difference for v lie at 
—0.05T0.51 days. The histogram reveals that for 80% of the HSS peaks identified. 
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CR 2111-2114 



Cross Corr delay = 2.50, cc = 0.33 




Delay [d] A 


Figure 4. Same as Fig |^for the magnetic field strength. 


the predicted arrival times lie within ±0.5 days of the observed peak arrivals. 
For B we found an average of 0.23 ± 1.53 days, but the histogram is broadly 
distributed over ±3 days. 


5. Discussion and Conclusion 


The adaptive algorithm delivers reliable predictions for the peaks in the undis¬ 
turbed solar wind speed (i.e. not affected by ICMEs) with a lead time of 4 days. 
The prediction accuracy of the HSS arrival times lies within ±0.5 days in 80% 
of the cases. The predicted amplitudes reveal a correlation of cc = 0.6 over the 
three years under study. Predictions for the solar wind speed peaks at 1 AU 
based on this empirical model are thus comparable to numerical models such as 


2014) 


ENLIL and WSA (tested for the year 2007 by Gressl et al 

Gressl et al. ( |2014[ ) performed a detailed study of ENLIL model runs with solar 
wind parameters at 1 AU for the year 2007. The numerical solar wind models 
produce the best results for the parameter solar wind speed. During times of 
low solar activity the numerical models predicted the arrival times of HSSs to 
have typical uncertainties of approximately 0.5-1.5 days with cross-correlation 


coefficients in the range of 0.5 to 0.7 (see also Lee et al 2009 Jian et al 2011 


Gressl et al. 2014). Also, the outcome for the numerical models was found to be 
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Figure 5. Minimum and maximum values of the prediction function (see Figures and 
of the solar wind speed measurements (top), total magnetic field strength (middle) and the 
corresponding fractional CH areas in the sliding windows covering 41 consecutive CRs. Number 
of ICMEs per CR (lowest panel). 


dependent on synoptic magnetic map used as an input as stated by [Riley et aL\ 

mm 

In the case of the solar wind magnetic field strength, the adaptive algorithm 
using linear prediction functions, does not work. The algorithm underestimates 
the magnetic field strength. In our current algorithm we derive the magnetic 
field by observing the sun in EUV and excluding any further information. It was 
shown that the primary driver of geomagnetic disturbances is the coupling of 


B and v ( jSiscoe and Crooks 1974 Crooker, Feynman, and Gosling[ 1977). vB 
(related to the electric field in the current sheet), vB^ (Poynting fiux) determine 


the level of geomagnetic activity (Perreault and Akasofu 1978| ). v^B‘^ describes 


the energy coupling function for the magnetospheric substorm and the delivered 


power (see also Kan and Lee 1979). One has to consider that magnetospheric 
current systems may respond differently to the HSS related and the CME-related 


solar wind disturbances (cf. Verbanac et al 2011 2013 and references therein). 


This behavior shows that the combination of the solar-wind speed and magnetic 
field is essential to the process of energy transfer from the solar wind to the 
magnetosphere. Furthermore, numerical models such as ENLIL and WSA un¬ 
derestimate the magnetic field strength by a factor of two and the performance of 
the prediction models is very sensitive to the input synoptic map, temperature 
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Figure 6. From left to right: Probability distributions of fractional CH areas A, solar wind 
speed V and total magnetic field strength B for the years 2011 (top), 2012 (middle), 2013 
(bottom) from SDO/AIA-193A, and ACE, respectively arithmetic mean and median values 
are indicated as blue and red lines. The yearly mean total sunspot numbers (SILSO), and 
number of identified ICMEs at 1 AU (from Richardson and Cane list) as given in the insets in 
row 1. 


and density values and the computational-grid resolution used (Gressl et al 


2014 Stevens et al. 2012 Riley et al. 2012). Also, CME have huge impact 


on the interplanetary background solar-wind structure. The effects of CMEs on 
the solar-wind conditions are observed by the in-situ measurements but do not 
appear in the predictions of the background solar wind. 

Eigure[^ illustrates that unreliable solar wind predictions can be caused by 
false detections of CHs in EUV images. We show coronal holes falsely detected 
by our algorithm (left top outlined with a blue circle) for the SDO/AIA-193A 
image taken on the 5th of August, 2012. In comparison, the Spatial Possibilistie 
Clustering Algorithm SPoCA-Suite ( Delonille et al.\ 2Q12[[Verbeeck et a/. [|2Q14) 


did not identify the filament as coronal hole (Image taken from helioviewer 


SOLA: SolPhys_Rotter_2014_answer.tex; 28 January 2015; 1:34; p. 10 























































































































Solar Wind Predictions 



Jon Mcr May Jul Sep Nov 


Figure 7. Solar wind speed v measured by ACE (black lines) and predictions derived from 
CH areas detected in SDO/AIA 193 A images (red lines). Cross-signs indicate peaks in the 
predicted solar wind speed and stars indicate peaks in situ measured solar wind speed. Start 
and end time of identified ICMEs at 1 AU from Richardson/Cane list (green bars). 


(http://www.helioviewer.org/). In the bottom right panel an Ha image recorded 


at Kanzelhoehe observatory (http://cesar.kso.ac.at/synoptic/ha_years.php; Potzi, 
Hirtenfellner-Polanec, and Temm^ |2Q13 ), clearly outlines the filament giving 


evidence of the current weakness of the algorithm to distinct between coronal 
holes and filaments, as they are both characterized by low intensities in the EUV 
images (see also de Toma, Arge, and Riley 2005). We are currently developing 


a geometrical classification model that will be implemented in the algorithm to 
reduce coronal hole classification errors by taking advantage of filaments being 


asymmetric structures aligned along a preferred direction (Reiss et al. 2014). 


Schwenn (2006) discusses that during low solar activity there appear no warps 


in the heliospheric current sheet (HCS) and the HCS lies close to the heliographic 
equator. So it can occur that we observe large polar holes and predict an increase 
in solar wind speed but the HSS is actually simply “missing” Earth. EignrepT] 
displays an example of a large coronal hole in the northern hemisphere on 2nd 
of July 2013. On this image, the SPoCA-Suite agrees with the detections of the 
coronal holes identified by our image recognition algorithm. The SPoCA-Suite 
also has the problem distinguishing between filaments and coronal holes, since 
it deals with intensity values. 
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Figure 8. Same as Fig. |^but for the total magnetic field strength. 


In general we note that in cases where no ICMEs are disturbing the solar wind 
parameters, the algorithm delivers reliable predictions of the solar wind speed at 
1 AU from solar observations with an lead time of about 4 days. The algorithm is 
applied in real-time HSS forecasting, being hourly updated with the most recent 
SDO/AIA I 93 A image (http://swe.uni-graz.at/solarwind). It is a sturdy and fast 
image processing algorithm that can utilize EUV images in manner of seconds 
and works for several satellite missions and different instruments. 
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Figure 10. Left top: Coronal holes detected by the algorithm for the SDO/AIA-193A image 
taken on 5th August, 2012. Right top: Measurements by ACE (black line) and predictions (red 
line) based on A. Bottom left: Coronal hole detections based on SPoCA-Suite. Bottom Right: 
KSO high contrast Ha image. 
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Figure 11. Same as Figure showing good agreement of the used algorithm (top left) 
and SPoCA-Suite (bottom right). But for 2nd July 2013 false solar wind speeds were pre¬ 
dicted (black circle). Blue arrows point at small features wrongly detected by SPoCA and our 
algorithm. 
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